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Abstract. Let F be a structure with a finite relational signature and a first-order def- 
inition in (R; *,-!-) with parameters from R, that is, a relational structure over the real 
numbers where all relations are semi-algebraic sets. In this article, we study the compu- 
tational complexity of constraint satisfaction problem (CSP) for F: the problem to decide 
whether a given primitive positive sentence is true in F. We focus on those structures F 
that contain the relations <, {(i, y,z) \ x-\-y — z} and {1}. Hence, all CSPs studied in this 
article are at least as expressive as the feasibility problem for linear programs. The central 
concept in our investigation is essential convexity: a relation S is essentially convex if for 
all a,b £ S, there are only finitely many points on the line segment between a and 6 that 
are not in S. If F contains a relation 5* that is not essentially convex and this is witnessed 
by rational points a, b, then we show that the CSP for F is NP-hard. Furthermore, we 
characterize essentially convex relations in logical terms. This different view may open up 
new ways for identifying tractable classes of semi-algebraic CSPs. For instance, we show 
that if F is a first-order expansion of (R;+,l,<), then the CSP for F can be solved in 
polynomial time if and only if all relations in F are essentially convex (unless P=NP). 
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1. Introduction 

Linear Programming is a computational problem of outstanding theoretical and practical 
importance. It is known to be computationally equivalent to the problem to decide whether 
a given set of linear (non-strict) inequalities is feasible, i.e., defines a non-empty set: 

Linear Program Feasibility 

INPUT: A finite set of variables V; a finite set of linear inequalities of the form aixi + • • • + 
o,kXk < ao where xi, . . . , Xk £ V and ao, . . . ,ak are rational numbers where the numerators 
and denominators are represented in binary. 

QUESTION: Does there exist an x G RI^I that satisfies all inequalities? 



This problem can be viewed as a constraint satisfaction problem, where the allowed 
constraints are linear inequalities with rational coefficients, and the question is whether 
there is an assignment of real values to the variables such that all the constraints are 
satisfied. For formal definitions of concepts related to constraint satisfaction, we refer the 



reader to Section 2.1 It is obvious that this problem cannot be formulated with a finite 



constraint language; however, we will later on (Proposition [2.12 ) see that the feasbility 
problem for linear programs is polynomial-time equivalent to the constraint satisfaction 
problem for the structure 

Piin := (M; {{x,y,z)\x + y = z}, <, {!}) . 

It is well-known that linear programming can be solved in polynomial time; moreover, 
several algorithms are known that are efficient also in practice. In this article, we study 
how far Piin can be expanded such that the corresponding constraint satisfaction problem 
remains polynomial-time solvable. An important class of relations that generalizes the class 
of relations defined by linear inequalities is the class of all semi-algebraic relations, i.e., 
relations that have a first-order definition over (M; *, -|-) using parameters from M. By the 
fundamental theorem of Tarski and Seidenberg, it is known that a relation S C M" is semi- 
algebraic if and only if it has a quantifier- free first-order definition in (M; *,+,<) using 
parameters from M. Geometrically, we can view semi- algebraic sets as finite unions of finite 
intersections of the solution sets of strict and non-strict polynomial inequalities. 

We propose a framework for systematically studying the computational complexity of 
expansions of T\[^ by semi-algebraic relations. In this framework, a constraint satisfaction 
problem is given by a (fixed and finite) constraint language T. All the constraints in 
the input of such a feasibility problem must be chosen from this constraint language F 
(a formal definition can be found in Section 2.1). This way of parameterizing constraint 
satisfaction problems by their constraint language has proved to be very fruitful for finite 
domain constraint satisfaction problems [I|[7[|9|[l4] . Since the constraint language is finite, 
the computational complexity of such a problem does not depend on how the constraints 
are represented in the input. We believe that the very same approach is very promising for 
studying the complexity of problems in real algebraic geometry. In Section [6] we will discuss 
a connection between some of the CSPs with semi-algebraic constraint languages and open 
problems in convex geometry and semidefinite programming. 

One of the key reasons why linear program feasibility can be decided in polynomial time 
is that the feasible regions of linear inequalities are convex. Convexity is not a necessary 
condition for tractability of semi- algebraic constraint satisfaction problems, though. It is, 
for instance, well-known that linear program feasibility can also be decided in polynomial 
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time when some of the input constraints are disequalities, i.e., constraints of the form 
aixi + • • • + afcXfc 7^ oq for rational values oq, . . . , a^. However, we show that if T^n ^ T 
and r contains a relation S with rational a,b £ S such that on the line segment L between 
a and b there are infinitely many points that are not in S, then the CSP(r) is NP-hard. 
This motivates the notion of essential convexity: a set 5 C M*^ is essentially convex if for all 
p,q £ S there are only finitely many points on the line between p and q that are not in S. 
One of our central results is a logical characterization of essentially convex semi- algebraic 
relations in Section [H This characterization can be used to show several results that are 
briefly described next. 

A relation is called semi-linear if it has a first-order definition with rational parameter^ 
in the structure (M; +, <). From the perspective of constraint satisfaction, the set of semi- 
linear relations is a rich set. For example, every relation S C with finitely many elements 
is semi-linear; thus, every finitary relation on a finite set can be viewed as a semi-linear 



relation. In Section 5.1, we show that when we add a finite number of semi-linear relations 
to Fiin, then the resulting language either has a polynomial-time or an NP-hard constraint 
satisfaction problem. This result is useful for studying optimization problems: note that 
linear programming can be viewed as optimizing a linear function over the feasible points 
of a set of linear inequalities. This view suggests an immediate generalization: optimize a 
linear function over the feasible points of an instance of a constraint satisfaction problem 
for semi-linear constraint languages. We completely classify the complexity of this problem 
in Section [121 

Another application concerns temporal reasoning. A temporal constraint language T 
is a structure (M; Ri, . . . , Ri) with a first-order definition in (M; <). Many computational 
problems in artificial intelligence and scheduling can be modeled as constraint satisfaction 
problems for temporal constraint languages. The complexity of the CSP for temporal con- 
straint languages F has been completely classified recently |6|; there are 9 tractable classes 
of temporal constraint satisfaction problems. Often, temporal languages are extended with 
some mechanism for expressing metric time, i.e., the ability to assign numerical values to 



variables and performing some kind of arithmetic calculations 11 . It has been observed 
that many metric languages F are semi-linear and satisfy Fun C F, and if such a language 
is polynomial-time solvable, then it is a subclass of the so-called Horn-DLR class |21| . Our 
result shows that this is not a coincidence: whenever F is not a subclass of Horn-DLR, then 
the CSP(F) is NP-hard. 



2. Preliminaries 



2.1. Constraint Satisfaction Problems. A first-order formul^is called primitive posi- 
tive (pp) if it is of the form 

3xi, . . .,Xn-{lpl A • • • A Ipm) 

^We deviate from model-theoretic terminology as it is used e.g. in [24| in that we only allow rational 
and not arbitrary real param eters i n first-order definitions. Our definition conincides with the definition of 
semi-linear sets given in e.g. 



12 



13 



Our terminolo gy i s standard; all notions that are not explicitly introduced can be found in standard 



textbooks, e.g., in 19 
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where ipi are atomic formulas, i.e., formulas of the form x = y or S{xi-^ , ■ ■ ■ , Xi^) where S is 
the relation symbol for a k-aiy relation in T. We call such a formula a pp-formula, and as 
usual a pp-formula without free variables is called a pp-sentence. 

Let r = {D; Si, . . . , S*;) be a structure with domain D and a finite relational signature. 
The constraint satisfaction problem for T (CSP(r) in short) is the computational problem 
to decide whether a given primitive positive sentence $ involving relation symbols for the 
relations in F is true in T. The conjuncts in a pp-sentence ^ are also called the constraints of 
^, and to emphasize the connection between the structure T and the constraint satisfaction 
problem, we typically refer to F as a constraint language. By choosing an appropriate con- 
straint language F, many computational problems that have been studied in the literature 
can be formulated as CSP(F) (see e.g. pi9]). 

When studying the complexity of different CSPs, it is often useful to be able to derive 
new relations from old. If F = {D; Si, . . . , Si) is a relational structure and S C is 
a relation, then (F, S) denotes the expansion {D; S, Si, . . . , Si) of the structure F by the 
relation S. We say that an n-ary relation S is pp-definable in F if there exists a pp-formula 
(p with free variables xi,...,Xn such that (xi,...,x„) € 5 iff (j){xi, . . . ,Xn) holds in F. 
The following simple but important result explains the importance of pp-definability for 
constraint satisfaction problems. 

Lemma 2.1 (Jeavons et al. |20|). Let T be a relational structure, and let S be pp-definable 
over F. Then CSP((F, 5)) is polynomial-time equivalent to CSP(F). 

2.2. Semi-algebraic and semi-linear relations. We say that a relation S C is first- 
order definable in a structure F with domain D if there exists a formula (/'(xi, . . . , x„) using 
universal and existential quantification, disjunction, conjunction, negation, and atomic for- 
mulas over r (where xi, . . • , Xn denote the free variables in (j)) such that 4>{ai, . . . , o„) is 
true over F if and only if (ai, . . . , a„) G 5. We always admit equality when building atomic 
formulas, i.e., we have atomic formulas of the form ti = t2 for terms ii,t2 formed from 
function symbols for F and variables. We say that S is first-order definable in F with pa- 
rameters from A, for A <Z D, additionally we can use constant symbols for the elements 
of A in the first-order definition of S. 

A set S C is called semi- algebraic if it has a first-order definition in (M; *, +) using 
parameters from M. Note that the order < of the real numbers is first-order definable in 
(M; *, +), since 

a < b 3c. b = a + c * c . 
We need some basic algebraic and topological concepts and facts. 

Definition 2.2 (Section 3.1 in [2]). A set 5 C is open if it is the union of open balls, 
i.e., if every point of S is contained in an open ball contained in S. A set S C M" is closed 
if its complement is open. The closure of a set S, denoted S, is the intersection of all closed 
sets containing S. Equivalently, S = {x £ M" \ \/r > 3y £ S. {y — x)^ < r^}. A point p in 
5 is a boundary point if for every e > 0, the n-dimensional open ball with radius e around 
p contains at least one point in S and one point not in S. The set of boundary points is 
denoted dS. The interior of S, denoted by 5°, is S" \ dS. 

Note that the interior of S consists of all p G S" such that there exists an e > with the 
following property: the n-dimensional open ball with radius e around p is contained in S. 
Also note that a finite union of closed sets is closed. 
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Proposition 2.3 (Proposition 2.2.2. in |4j). The closure of a semi-algebraic relation is 
semi- algebraic. 

We use tlie notion of dimension dim(S') € N of a semi-algebraic set S as defined in j4]. 

Definition 2.4 (Section 2.8 in Q). Let C M'^ be a semi- algebraic set, and let V{S) 
be the ring of polynomial functions on S, i.e., the ring of functions S — t- M which are the 
restriction of a polynomial. Then the dimension of S, denoted by dim(S'), is the maximal 
length of chains of prime ideals of V{S), i.e., the maximal d such that there exist distinct 
prime ideals /q, /i, . . . , of ViS) with Iq C I2 C • • • C Id- 

To work with this definition of dimension, we need some more concepts. 

Definition 2.5 (see [4]). Let 5 C and T C M' be semi- algebraic sets. A function 
f:S^Tis semi- algebraic if the set {{xi, . . . , Xk,yi, ■ ■ ■ ,yi) \ f{xi,...,Xk) = {yi,...,yi)} 
is a semi-algebraic subset of M*^"^' . 

As usual, bijective functions f : S ^ T such that S" C 5 is open if and only if f{S') C T 
is open are called homeomorphisms. 

Lemma 2.6 (Propositions 2.8.5, 2.8.9, and 2.8.13 in |4j). Let S (^W be semi- algebraic. 

• If S = SiU S2 then dim(5) = max(dim(S'i), dim(S'2)). 

• // there is a semi-algebraic homeomorphism from S to (0, l)'', then dim(S') = d. 

• dim{S\S) < dim(5). 

In particular, if S" C T, then dim(S') < dim(T). 

A set V C M" is called an (algebraic) variety if it can be defined as a conjunction of 
the form pi = A • • • Apm = where pi, . . . ,Pm are polynomials in the variables xi, . . . ,Xn 
with coefficients from M. We allow terms in polynomials to have degree zero. 

Lemma 2.7. Let V CW^ be a variety and let L C M" 6e o line. If infinitely many points 
of L are in V , then L <^V . 

Proof. Let V be defined by pi{xi, . . . , Xn) = A • • • A Pm{xi, . . • , x„) = 0, and let Zi, . . . , /„ 
be univariate linear polynomials such that L = {(/i(x), . . . , Inix)) \ x G M}. For each pi, the 
univariate polynomial pi{li{x), . . . , ln{x)) equals infinitely often. So it is always 0, and it 
follows that every point on L satisfies pi = A • • • A pm = 0. □ 

Theorem 2.8 (Tarski and Seidenberg; Proposition 5.2.2 in [4|). Every first- order formula 
over (]R;*,-|-,<) with parameters from M is equivalent to a quantifier-free formula with 
parameters from M. 

By an interval we mean either an open, half-open, or closed interval with more than 
one element. An ordered structure (£);<,...) is o-minimal (see [23] , Definition 3.1.18) 
if for any first-order definable S Q D with parameters from D there are finitely many 
intervals Ii, . . . ,/m with endpoints in U {±00} and a finite set -Do ^ D such that S = 
Do U Ii U • • • U Im • The following is an easy and well-known consequence of Theorem |2.8[ 

Theorem 2.9 (see e.g. [23]). Let Ri,...,Rn be semi- algebraic relations. Then (M; < 
, . . . , Rn) is o-minimal. 

A set S C M"- is called semi-linear if it has a first-order definition in (M; +, <) with 
parameters from Q; we also call first-order formulas over (M; +, <) with parameters from 
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Q semi-linear. It has been shown in 12,13] that it is decidable whether a given first- 
order formula over (M; *, +, <) with parameters from Q defines a semi-hnear relation or 
not. A set V C M" is called a linear set if it can be defined as a conjunction of the form 
Pi > A • • • Apm > where pi, ■ ■ ■ ,Pm are linear polynomials in the variables xi, . . . ,Xn with 
coefficients from Q. It is not hard to see that every semi- linear relation S can be viewed as 
a finite union of linear sets. We also have quantifier elimination for semi-linear relations. 



Theorem 2.10 (Ferrante and Rackoff [15]). Every semi-linear relation has a quantifier-free 
definition over (M; +, — , <) with parameters from Q. 



2.3. Definability of Rational Expressions. The following elementary lemma will be 
needed for the observation that the feasibility problem for linear programs is polynomial- 
time equivalent to CSP(riin); it is also essential for the hardness proofs in Section [s] and for 
proving the dichotomy result for metric temporal constraint reasoning. 

Lemma 2.11. Let no,ni, . . . ,ni G Q be rational numbers. Then the relation {{xi, . . . ,xi) \ 
nixi + • • • + nixi = no} is pp-definable in (M; {(x, y,z) \ x -\- y = z}, {1}). Furthermore, the 
pp-formula that defines the relation can be computed in polynomial time. 

Proof. We first note that we can assume that no, ni, . . . , n; are integers. To see this, suppose 
that the rational coefficients no, . . • , n; are represented as pairs of integers (oq, bo), ... , (a;, bi) 
where Oj denotes the nominator and bi the denominator. Let c = Y[\=o ^* ^^"^ create a new 
sequence of coefficients n^, . . . ,n[ = (ap • c/6o, 1), • • • , (a« • c/fe;, 1). The resulting equation 
is obviously equivalent. It is also clear that it only takes polynomial time to compute such 
coefficients. 

Before the actual proof, we note that x = is pp-definable hy x + x = x, and we 
therefore freely use the terms and 1 in pp-de&iitions. Similarly, x = — 1 is pp-definable 
by X H- 1 = 0. The proof is by induction on /. We first show how to express equations of the 
form nixi + n2X2 = X3. By setting X2 to — 1 and X3 to 0, this will solve the case / = 1. For 
positive ni, n2, the formula nixi + n2X2 = X3 is equivalent to 

ni —1 

3ui, . . . ,Uni,Vi, . . . ,Vn2- Itl = Xi A f\ Xi + Ui = Ui+i 

1=1 
n2 — l 

AVi= X2A f\ X2+Vi = Vi+i 
i=l 

A tt„i + Vn2 = X3 . 

However, this formula is exponential in the representation size of ni and n2, and cannot be 
used in polynomial-time reductions. 

Let bit{n,i) denote the i-th lowest bit in the binary representation of an integer n and 
1 < * < l^og nj + 1 . The following formula is equivalent to the previous one (we are still in 
the case that both ni and n2 are positive) and has polynomial length in the representation 
size of Ul and n2. Write mi for [lognij + 1 and m2 for [logn2j + 1. 
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mi 



3ai, ami,bi, 6^21 ci, Cmi,di, dm2- ai = xi A ^ aj_i + ai-i = at 

i=2 

7712 



i=2 

mi 



A ci = bit{ni, l)ai A ^ bit{ni,i)ai + Ci_i 



m2 

A di = bit{n2, l)bi A ^ bit{n2,i)bi + dj-i = 

Cmi ~l" '^m2 — -^3 

If / = 2, and ni = or n2 = 0, then the proof is similar. If ni and n2 have different 
signs, we replace the conjunct Cmi + dm2 = in the formula above appropriately by 
Cmi + = dm2 or dm2 + X3 = Cmi ■ If both rii and n2 are negative, then we use the 
pp-definition 3x'^.{—niXi — 71-2X2 = x'^ A x'^ + = 0). 

Equalities of the form niXi + n2X2 = uq can be defined by 3x3.(711X1 + 712X2 = X3 AX3 = 
no). Now suppose that / > 2. By the inductive assumption, there is a pp-definition 
01 (xi, X2, u) for TiiXi + 77.2X2 + u = riQ and a pp-definition (/)2(x3, . . . , x;, li) for 713X3 + • • • + 
nixi = u. Then 3u.{4)i A (P2) is a pp-definition for tiiXi + • • • + nixi = hq. It is clear that 
the pp-definition given above can be computed in time which is polynomial in the number 
of bits needed to represent the input. □ 

By extending the previous result to inequalities, we prove that CSP(riin) and linear 
program feasibility are polynomial-time equivalent problems. The dichotomy for metric 



temporal reasoning follows immediately by combining this result and Theorem 5.2 



Proposition 2.12. The linear program feasibility problem is polynomial-time equivalent to 

csp(riin). 

Proof. It is clear that an instance of CSP(riin) can be seen as a linear program feasibility 
problem, since the three different relations in the constraint language, x + y = 2:, x = l, 
X < y, are linear. 

For the opposite direction, let $ be an arbitrary instance of the linear program feasibility 
problem. Given a linear equality L(xi, . . . , x^) = cixi+- • ■+CkXk = cq, let 4>L{xi,...,Xk) denote 



the pp-definition of L{xi, . . . , x^) in (M; {(x, y,z) \ x + y = z}, {1}) obtained in Lemma 2.11 
Construct an instance ^ of CSP(riin) by replacing each occurrence of a linear inequality 

constraint cixi + • • • qx; < cq in ^ by a (pcixiA hcixi-y=o ^ y 1^ cq; use fresh variables for 

y and for the existentially quantified variables introduced by (p^. The resulting formula ^ 
can be rewritten as a primitive positive sentence over T without increasing its length and. 



by Lemma 2.11, the length of ^ is polynomial in the length of Since $ is satisfiable if 



and only if ^ is satisfiable, this shows that the problems are polynomial-time equivalent. O 
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3. Hardness 

We consider relations that give rise to NP-hard CSPs in this section. We first need some 
definitions: a relation S* C M*^ is convex if for all p,q € S, S contains all points on the line 
segment between p and q. We say that a relation S" C M'^ excludes an interval if there are 
p,q £ S and real numbers < 6i < 62 < I such that p+ {q — p)y S whenever 5i < y < 82- 
Note that we can assume that di,52 are rational numbers, since we can choose any two 
distinct rational numbers 71 < 72 between 61 and ^2 instead of 61 and 62- 

Definition 3.1. We say that S C M" is essentially convex if for all p,q €z S there are only 
finitely many points on the line segment between p and q that are not in S. 

If S is not essentially convex, and if p and q are such that there are infinitely many 
points on the line segment between p and q that are not in S, then p and q witness that S 



is not essentially convex. The following is a direct consequence of Theorem 2.9 and we will 
use it in the following without further reference. 

Corollary 3.2. If S is a semi-algebraic relation that is not essentially convex, then S 
excludes an interval. If S is an essentially convex semi- algebraic relation, and a, b are two 
distinct points from S, then the line segment between a and b contains an interval I with 
I OS. 

The next proposition will be used several times in the sequel; it clarifies the relation 
between finite unions of varieties and essentially convex relations. 

Proposition 3.3. Let W be a finite union of varieties Vi, . . . ,Vk C M", and let C QW be 

essentially convex. Then, there is an i < k such that C '^Vi. 

Proof. Let J C be minimal such that C Q [J^^jVi. If |J| = 1, then there is 

nothing to show. So suppose for contradiction that there are distinct i,j £ J. Then there 
must be points a,b G C such that a £ Vi and a ^ Vi for all / G J \ {i}, and b £ Vj and 



b ^ Vi for all / G J\{j}- By essential convexity of C and Corollary 3.2, the line segment L 



between a and b must contain an interval / that lies in C. Since J is finite, there must be 



/ G J such that infinitely many points on / are from Vi. By Lemma 2.7, all points on the 



line through a and b are from Vi ; this contradicts the choice of a and b. O 

The rest of the section is divided into two parts. We first prove that if 5 C is a 
semi- algebraic relation that is not essentially convex and this is witnessed by two rational 
points p and g, then CSP((riin, S)) is NP-hard. In the second part, we prove that if 5" C M'^ 
is a semi-linear relation that is not essentially convex, then this is witnessed by rational 
points and, consequently, CSP((riin, S")) is NP-hard. 



3.1. Semialgebraic relations and rational witnesses. We begin with the special case 
when 5 is a unary relation. The hardness proof is by a reduction from CSP(({0, 1}; -R1/3)) 
where 

i?i/3 = {(1,0,0), (0,1,0), (0,0,1)} . 
This NP-complete problem is also called Positive One-In-Three 3Sat jl6j L04], which 
is the variant of One-In-Three 3Sat where we have the extra requirement that in all 
input instances of the problem, no clause contains a negated literal. 

Lemma 3.4. Let S (^M be a unary relation. If S excludes an interval and this is witnessed 
by rational points p and q, then CSP((riiji, S)) is NP-hard. 
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Proof. We know that there are rational numbers < 6i < 82 < 1 such that p + {q — p)y S 
whenever 5i < y < 62- Let 

a = sup{(52 - (5i I < (5i < ^2 < 1 and [p+ {q-p)5i,p + {q - p)62] n S = 0}, 

i.e., the least upper bound on the length (scaled to the interval [0,1]) of excluded intervals 
between p and q. Choose rational numbers 61,62 such that 

• there exists y £ [61 — d, 61] such that p + {q — p)y G S; and 

• there exists y £ [62,62 + d] such that p + {q — p)y G S. 

• S n [p + {q - p)6i,p + (q - p)62] = f/}. 

where d = {62 — 6i)/5. It is easy to see that such 61,62 exist; we simply need to find 61, 62 
such that S r\\p + {q — p)6i,p+ {q — p)62] = and 62 — 61 is sufficiently close to a. Clearly, 
for any e > 0, there exist suitable 61, 62 such that a — {62 — 61) < e. 
Now, define p' = p + {q — p){6i — d) , q' = p + {q — p){62 + d), and 

U{y) = 3z.{z=p' + {q' -p')y A S{z) A < y < 1). 



Observe that U is pp-definable in riinU{5} by Lemma 2.11 combined by the fact that p' and 
q' are rational numbers. We claim that U contains at least one point in the interval [0,(i'], 
at least one point in the interval [1 — d', 1], and no points in the interval [d' , 1 — d'] where 
d' = 1/7. Let us consider the interval [0, d']. The point (expressed imp and q) corresponding 
to y = is (which equals p+ [q — p){6i — d)) while the point corresponding to y = 1/7 is 

p'+^^^=p+{q-p){6,-d) + ^-±^^^^^^^-^-^''-^^^^'-^^ 



p+{q-p){6i-d) + 

■p + {q-p){^i -d) + 
■p + {q-p){^i -d) + 



7 

{q-pm2 + d)-{6i-d)) 
7 

{q-p){62-6i+2d) 
7 

{q-p){5d + 2d) 



7 

= p+{q-p){6i-d) + {q- p)d 
= p+{q-p)6i 

We know that the choice of 61 and 62 implies that there is at least one point in S on 
the line segment between p+ {q — p)[6i — d) and p + [q — p)5i. The other two cases can be 
proved similarly. 

We show NP-hardness by a polynomial-time reduction from CSP(({0, 1}; Let (j) 

be an arbitrary instance of this problem and let V denote the set of variables appearing in 
(j). Construct a formula 

Tp=/\U{v)A /\ Vi+Vj+Vk>-A /\ Vi + Vj+Vk<Y- 



Lemma 2.11 implies that tj: is pp-definable in (M;{(x,y,z) \ x + y = z},{l},<,C/) (and, 
consequently, pp-definable in (M; {(x, y,z) \ x + y = z}, {1}, <, S)) and the formula can be 
constructed in polynomial time. We now verify that the formula ip has a solution if and 
only if (j) has a solution. 
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Assume that there exists a satisfying truth assignment /: V — )• {0,1} to the formula 
(j). We construct a solution g for ^ as follows: arbitrarily choose a point to £ [0, d'] such 
that to £ U and a point ti £ [1 — d' , 1] such that ti E U. Let g{v) = to if f{v) = 
and g{v) = ti, otherwise. Clearly, this assignment satisfies every literal of the type U{v). 
Each literal Vi + vj + Vk > 6/7 is satisfied, too, since g{vi) + g{vj) + (7(fA,.) = 2 ■ to + ti > 
2 • + (1 — d') = 1 — d' = 6/7. Similarly, each literal Vi + + Ufc < 11/7 is also satisfied: 
givi) + 5(7;^) + ^(wfc) = 2 • to + ti < 2 • d' + 1 = 9/7. 

Assume now instead that there exists a satisfying assignment g: V for the formula 
■0. Each variable obtains a value that is in either the interval [0, d'] or in the interval [1—d', 1]. 
If a variable is assigned a value in [0,(i'], then we consider this variable 'false', i.e., having 
the truth value 0; analogously, variables assigned values in [1 — d' , 1] are considered 'true'. 

We continue by looking at an arbitrary literal Ri/^{vi,Vj,Vk) in cp and its corresponding 
inequalities (1) Vi+vj+Vk > 6/7 and (2) Vi+vj+Vk < 11/7. If all three variables are assigned 
values within [0, d'], then their sum is at most 3d' = 3/7 which violates inequality (1). If two 
of the variables appear within [1 — d' , 1], then their sum is a least + 2(1 — d') = 12/7 which 
violates inequality (2); naturally, this inequality is violated if all three variables appear 
within [1 — d', 1], too. If exactly one variable appears within [1 — d' , 1], then the sum of the 
variables is at least 1 — d' = 6/7 and at most 1 + 2d' = 9/7. We see that both inequality 
(1) and (2) are satisfied. Hence, we can define a satisfying assignment f : V ^ {0, 1} for (p: 



fiv) 



if < g{v) < d' 

1 otherwise 



This concludes the proof. □ 



It is now straightforward to lift Lemma 3.4 to relations with arbitrary arities. 



Lemma 3.5. Let S O R'' be a semi- algebraic relation that is not essentially convex, and 
this is witnessed by two rational points p = {pi, . . . ,pk) and q = (qi, . . . , q^S). Let T be the 
structure (riin,^). Then, CSP(r) is NP-hard. 

Proof. Define 

k 

U{y) = 3z. I\zi=pi + {qi - pi)y A S{z) A < y < 1 
1=1 



where z = {zi, . . . ,Zk)- By Corollary 3.2, U excludes an interval and CSP(r) is NP-hard 



by Lemma |3.4| since U is pp-definable in P. □ 

Remark 3.6. If S* is not essentially convex and this is witnessed by non-rational points 
only, then the problem CSP(r) for P = (M; {(x, y, z) \ x + y = z}, {1}, <, S) might still be 
solvable in polynomial time. Consider for instance the binary relation 

S = {{x,y)\ {\x + y\<l)A{y = V2x^\x + y\ = 1)} . 

Clearly, S is not essentially convex; however, the only witnesses are {^/2 — 1, 2 — \/2) and 
(-^2 + 1,-2 + ^/2) (see Figure [l|). 

We show that CSP(P) can be solved in polynomial time. To see this, define 

S' = {{x,y) I {\x + y\ < 1) A(x/OVy/0)} 

and A = (M; {(x, y, z) \ x + y = z}, {1}, <, S"}. We first prove that a primitive positive 
sentence is true in P if and only if it is true in A. Clearly, if a primitive positive sentence is 
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Figure 1: Illustration of relation S 



true in F, then it is also true in A, since the relations in A are supersets of the corresponding 
relations in T. Conversely, suppose that <I> is primitive positive and true in A. Let a be 
an assignment of the variables of $ that satisfies all conjuncts in <I>. Since A is semi-linear, 
we can assume that a is rational (see Lemma 3.7). 
in r and in A are the relations S and S' . 



The only relations that are different 
Since S' \ S contains irrational points only, the 



assignment a shows that $ is also true in A. Finally, CSP(A) can be solved in polynomial 
time by the results in Section 5.1 (note that the constraint |x + y| < 1 is equivalent to a 



conjunction of four linear inequalities). 



3.2. Semilinear relations. In the previous section, we showed that there exists a semi- 
algebraic relation S that is not essentially convex, but CSP((M; {{x, y, z) \ x + y = z}, {1}, < 
, S)) is polynomial-time solvable. If we restrict ourselves to semi- linear relations S, then this 
phenomenon cannot occur: indeed, in this section we prove that if a semi-linear relation 



S is not essentially convex, then this is witnessed by rational points (Lemma 3.9), and 
CSP((M; {(x, y,z)\x + y = z}, {!}, <, S)) is NP-hard. 

Lemma 3.7. Every non-empty semi-linear relation S contains at least one rational point. 

Proof. Assume first that S" is a non-empty unary relation such that S" Pi Q = 0. If S" 
contains infinitely many points, then it also contains an interval due to o-minimality of S] 
this contradicts that 5 n Q = 0. So we assume that S contains a finite number of points. 
Consider the unary relation S' = {min(5)} and note that it can be pp-defined in iV\\^, S) by 
S'{x) = S{x) Ax < p where p denotes a suitably chosen rational number. By Theorem 2.10| 



S' has a quantifier-free definition (p over (M; +,—,<) with parameters from Q, and we can 
without loss of generality assume that (j) is in disjunctive normal form, and contains a 
single disjunct since \S'\ = 1. Assume without loss of generality that every conjunct of this 
disjunct of (p is of one of the following forms: x > c, x < c, oi x ^ c (where c denotes some 
rational number). Let a = max{c \ {x > c) € cp} and b = min{c \ {x < c) £ <j)}. li a = b 
then S' = {a} and we have a contradiction since a is a rational number. If a < 6, then S' 
contains an infinite number of points (regardless of the number of disequality constraints 
in (/>) and we have a contradiction once again. 

Assume now that ar{S) = d > 1. Arbitrarily choose a point s = {si, . . . , Sd) G S 
with a maximum number of rational components. Assume without loss of generality that 
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Si, . . . , Sk' , k' < k are rational points and consider the unary relation 

U{xk) = 3x1, ... , Xk-i.{S{xi, ...,Xk) A xi = si A • • • A Xfc' = Sk'). 
We get a contradiction since C/ n Q = 0, [/ is non-empty, and U is semi-linear. O 

Corollary 3.8. Let S ^ R'' be a semi-linear relation and let s G S be arbitrary. Then, 
every open k- dimensional ball B around s of radius e > contains a rational point in S. 

Proof. If there is an e such that B does not contain any rational point in 5, then there is 
a linear set P within B such that S P only contains irrational points. This contradicts 
Lemma |3.7| □ 

A hyperplane is a set V = {x ^ R^ \ p{x) = 0} where p is a linear term such that 
C y C M'^ (this makes sure that the degree of p is one). We do not require that the 
coefficients in p are rational; it is important to note that this differs from the definition of 
a linear set. If all coefficients appearing in p are rational, then we say that the hyperplane 
is rational. 

Lemma 3.9. If T is a semi-linear relation that is not essentially convex, then this is 
witnessed by rational points, and CSP((r;j„,, T)) is NP-hard. 

Proof. If there are rational witnesses of the fact that T is not essentially convex, then 
NP-hardness follows from Lemma 13.51 and we are done. 

Assume now that there exists a relation T that is not essentially convex but T lacks 
rational witnesses. Arbitrarily choose such a T with minimal arity k. We first consider the 
case when k = 1. Arbitrarily choose witnesses p,q & T. By o- minimality, there are finitely 
many intervals Ii, . . . ,Im with endpoints in M U {±00} and a finite set Dq C M such that 
T = DqU UilLi ^i- Now, apply the following process to Dq and /i, . . . , Im- 

• if there is a point d G Dq and an interval Ij, 1 < j < m, such that d is in Ij, then remove 
d from Dq and replace Ij with Ij U {d}; 

• repeat until Dq is not changed. 

After these modifications, the sets Ii, . . . , are still (open, half-open, or closed) inter- 
vals, and for every point d S Dq, there exists an > such that [d — ed,d-\- ed]riT = {d}. 

Assume without loss of generality that p Q. If p G Dq, then choose rational numbers 
p^ ,p^ such that p — ep < p^ < p < p^ < p + ep; this is always possible since the rationals 
are a dense subset of the reals. Consider the semi-linear relation 

r'(x) = T{x) f\p~ < X < p^ 
and note T' = {p}. However, p is not a rational number which contradicts Lemma |3.7[ 
We may thus assume that p Dq and that p is a member of an interval / G {Ii, . . . , Im}- 
Arbitrarily choose one rational point p' £ I; once again, this is possible since the rationals 
are a dense subset of the reals. Note that p',q witness that T is not essentially convex. If 
q £ Q, then we are done so we assume that q ^ Q. We see that q Dq by reasoning as 
above. Consequently, (7 is a member of an interval J £ {Ii, . . . ,Im} and I ^ J. Finally 
choose a rational point q' £ J and note that p',q' are rational points witnessing that T is 
not essentially convex. 

Assume instead that k > 1. Let Sk denote the set of relations S that satisfy 1,2, and 

3: 

(1) S is a semi- linear relation of arity k. 



ESSENTIAL CONVEXITY AND COMPLEXITY OF SEMI-ALGEBRAIC CONSTRAINTS 



13 



(2) S is not essentially convex, and 

(3) for every pair of witnesses that S is not essentially convex, at least one is irrational. 
We now conclude the proof by considering two different cases. 

Case 1: There exists an 5 € Sk and a finite set of rational hyperplanes Hi, . . . , Hh such 
that S C |Jj=i-^i- Choose the hyperplanes such that h is minimal. Let VjW £ S he 
arbitrarily chosen witnesses for the fact that S excludes an interval, and let / denote this 
interval. 

Suppose first that h = 1, i.e., that there is a single hyperplane H such that S H. 
Obviously, x = (xi, . . . ,Xk) £ H <^ c\X\ + • • • + CfcXfc = cq for some rational constants 
Co, . . . , Cfc. We assume without loss of generality that at least one say c/j, is non-zero. 
Define the relation S' by 



Ql, (Qi N. Co - CiXi - . . . - CA;_iXfc_i 

6 (xi,...,Xfc_i) = 3y.(6(xi,...,Xfc_i,y) Ay = ) . 

Let v' = (vi, . . . , Vk-i) and w' = {wi, . . . , Wk-i), and note that v' , w' are witnesses of an 
excluded interval in 5'. If S' lacks rational witnesses of essential non-convexity, then the 
fact that S' has arity k — 1 contradicts the choice of T. Hence, S' has two rational witnesses 
t = {ti, . . . ,tk-i) and u = {ui, . . . ,Uk-i)- This implies that 

Co — Cjti — ... — Ck-ltk- 



t' = ( ti, . . . , tk- 

and 



, / Cq - CiUi - ... - Ck-lUk-l 

U = Ul, . . ..Uk-l-, 

V Cfc 

are rational witnesses for S*, which leads to a contradiction. 



Next, suppose that h > 2. Ijel H[ = S r\ {Hi \ [j'j=2^j) and H!^ = S D {H2 \ 
Uje{i 3 h}^j)- minimal choice of h, H[ and H2 are non-empty. Furthermore, 



they are semi-linear so we can choose rational points Pi G H'-, 1 < i < 2, hy Lemma 3.7 



We now claim that at most a finite number of points on the line segment between pi and 
P2 lie in S. Suppose to the contrary that infinitely many points lie on the line segment. 
Then, there must be one Hi, i > 1, such that infinitely many points from Hi lie on the line 
segment. Hence, Hi (since it is a variety) must contain the entire line by Lemma 2.7 This 
leads to a contradiction since pi and p2 are chosen so that |{pi,P2} H Hj\ < 1, 1 < j < h. 
Thus, we have found rational witnesses for essential non-convexity of S and obtained a 
contradiction since S £ Sk- 

Case 2: There is no /S € Sk such that there exists a finite set of rational hyperplanes 
Hi, ... , Hh and S C IJj=i ^j- Arbitrarily choose S £ Sk, let v,w £ S he arbitrarily chosen 
witnesses for the fact that S excludes an interval, and let / denote such an interval. 

If there exists a rational hyperplane H such that {v, w} S D H, then the semi-linear 
relation 

S'{xi, ...,Xk) = S{xi, ...,Xk) A H{xi, ...,Xk) 
excludes an interval and this is witnessed by v and w. Obviously, S' S Sk and S' C H. 
This contradicts the assumptions for this case so we assume that {v,w} (and consequently 
/) do not lie on any rational hyperplane. 
Next, we prove a couple of facts. 
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Fact 1: I S \ S. We show that there is no point e £ I and an e > such that the open 
fc-dimensional baU B around e with radius e satisfies B D S = 0. Assume to the contrary 



that there is a point e S I satisfying this condition. By Corohary 3.8 there exist rational 
points in S arbitrary close to v and w. Thus, one can find rational points v',w' E S such 
that the line segment L between v' and w' passes through B and L' = LCi B has non-zero 
length. In other words, v' and w' are rational witnesses of an excluded interval and we have 
obtained a contradiction. 

Fact 2: There exists a finite set {-ffi, . . . ^iih\ of rational hyperplanes such that 5 \ 5 C 
U^^^ Hi. Let (/) be a first-order definition of S and let -0 = -Ci V • • • V Z)„ be a quantifier- free 



definition of S in disjunctive normal form; such a ^ exists due to Theorem 2.10 Note 
that every parameter appearing in is rational: initially, every parameter in (/> is rational, 
the quantifier elimination does not add any irrational parameters, and the conversion to 
disjunctive normal form does not introduce any new parameters. Let li, . . . ,1^ denote the 
literals appearing in cj). For each li = p{xi, . . . , x^) r (where r G {<, <, =, ^, >, >}), create 
a hyperplane Hi = {(xi, . . . , x^) C M'^ | p[xi, . . . ,Xk) = 0}. In other words, we let the 
boundary of the subspace defined by k define a hyperplane Hi. It is now easy to see that 
S \ S CI dS C U™ Hi. Furthermore, every hyperplane Hi, ... , Hm is rational. 

We are now ready to prove the second case of the proof. By Fact 1, I C 5 \ 5. The set 
5 \ 5 is a subset of (jf^^ Hi where Hi, ... , H^ are rational hyperplanes by Fact 2. Hence, 



/ is a subset of some Hi by Proposition 3.3 a contradiction. □ 



4. Essentially Convex Relations 

Before we present a logical characterization of essentially convex semi-algebraic relations, 
we give examples that show that two more naive syntactic restrictions of first-order formulas 
are not powerful enough for defining all essentially convex semi- algebraic relations. Both 
of those restrictions are motivated by classes of essential convex semi-linear relations that 



have appeared in the literature, cf. 21 . When S" is a subset of M", we write -^S for the 
complement of S, i.e., for M" \ S. 

We start with an example that shows that not every essentially convex semi- algebraic 
relation can be defined by conjunctions of first-order formulas of the form 

Pi /OV--- Vpfc /0V(/. 

where pi, . . . ,pk are polynomials with coefficients from M, and where 4> defines a convex set. 
It is easy to see that every relation that can be defined by such a conjunction is essentially 
convex. 

See Figure § left side. The fi gure shows a 1-dimensional variety C C M^, given as 
{(p(t), q{t)) I t S M} for polynomials p and q. The figure also shows two marked segments 
5i, S2 on the curve C. The marked segments are chosen such that one end point is contained 
in interior of the convex hull of the other three end points of the segments. 

Let S be the set -iC U 5i U ^2. Clearly, S is essentially convex. Now, suppose for 
contradiction that 5 has a definition ■0 as described above. Let H be the convex hull of 
5*1 U /S'2. The crucial observation is that the set G := {H n C) \ (^i U ^2) is infinite. Since no 
point from G is in S, there must be a conjunct pi7^OV---Vpfc7^OV</>in0 that excludes 
infinitely many points from G. In particular, the variety V defined hy pi = ■ ■ ■ = pk = 
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contains infinitely many points from C. As in the proof of Lemma 2.7 one can see that V 
must contain C. Hence, all points in 5i U S2 must satisfy 0; but in this case, all points in 
G satisfy pi 7^ V • • • V 7^ V 0, a contradiction. 




Figure 2: Examples of essentially convex relations. 

Our first example might motivate the following notion of definability: we consider 
conjunctions of formulas of the form pi ^ OV---Vpfc 7^ OV0 such that for every conjunction 
of linear equalities ip that implies pi = for all i < k, the set defined by (/) A ■0 is convex. 
The set described above can indeed be defined by such a formula. Similarly as before, it is 
also easy to see that all relations that can be defined in such a way are essentially convex. 
However, we again have an example of a semi- algebraic essentially convex relation that 
cannot be defined by such a conjunction. 

See Figure [2| right side. The figure shows the boundary i? of a doubly infinite cone 
with apex o. On the boundary, there is a straight line segment L through o, and a circle C 
that cuts L. Let S be the set U C U (L \ {o}). It can be verified that S is essentially 
convex. However, we claim that there is no conjunction as described above that defines 
S. The reason is that when pi 7^ V • • • V pfc 7^ V is such that pi = A • • • A = 
describes and if (p contains C U (L \ {o}), then it must also contain o in order that 
Pi7^0V---Vpfc/0V(/) meets the required condition. However, the set -^B U C U L is not 
essentially convex since o and points from the cycle exclude an interval. 

The correct definition of formulas that correspond to essentially convex sets has to take 
these examples into account. We call these formulas convex Horn formulas. Basically, a 
convex Horn formula is a conjunction of implications pi = • • • = pfc = — t- (/» such that the 
premise defines a variety V , and the formula is again convex Horn when restricted to any 
convex subset of V . Formally, we have the following definition. 

Definition 4.1. The set of convex Horn formulas is the smallest set of first-order formulas 
such that 

• all formulas defining convex closed semi-algebraic relations over (M; +, *, <) with param- 
eters from M are convex Horn; 
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• Suppose that pi,.-.,Pk are polynomials, (p is a first-order formula that defines a set 
U C M", and for every semi- algebraic convex set C contained in the set defined by 
Pi = ■ ■ ■ = Pk = 0, the set C nU can be defined by a convex Horn formula, and has 
strictly smaller dimension than the set defined hy tp = {pi ^ ■ ■ ■ V pk ^ 0\/ (j)). Then 
ip is also convex Horn. 

• Finite conjunctions of convex Horn formulas are convex Horn. 

For example, the formula (x^ — y 7^ 0) A (y > 1), describing the half-plane above y = 1 
with exception of the standard parabola, is convex Horn. Every convex set C contained in 
the set defined by — y = consists of at most one point, and hence is 0-dimensional and 
can be defined by a convex Horn formula. 

We can prove properties about the set of all convex Horn formulas by induction over 
the level of a convex Horn formula, which is defined as follows. The level of a formula that 
defines a convex closed semi-algebraic relation is 0. Now, suppose we have already defined 
convex Horn formulas of level smaller than i, and let ip he a. convex Horn formula that does 
not have level smaller than i. Then ^p has level i if it is the finite conjunction of formulas 
V'' ^ (pi 7^ V • • • V pfc 7^ V 0) such that for every semi-algebraic convex set C contained 
in the set defined hy pi = ■ ■ ■ = pk = 0, the intersection of C with the set defined by cp is 
convex Horn, has level at most i — 1, and strictly smaller dimension than the set defined by 
V''. Since the intersection of sets of dimension n has at most dimension n, it follows directly 
from the definition of convex Horn formulas that the level of a convex Horn formula p is 
bounded by the dimension of the set defined by (p. 

Looking back at the formula (a;^ — y 7^ 0) V (y > 1), we claim that it is convex Horn of 
level one: every convex set contained in the parabola consists of at most one point, and can 
hence be described by a convex Horn formula of level zero. For another example, consider 

- 7/ / 0) V (z > 0), i.e., the same parabola in three dimensions on one side side of the 
x-y-plane. Each convex subset of x'^ — y = is a point, a straight line, or a line segment in 
the z direction and can again be described by a level zero convex Horn formula. 

We are now ready to logically define essentially convex semi-algebraic sets via convex 



Horn formulas. This is done in two steps; we first prove (in Proposition 4.2) that every set 
defined by a semi-algebraic convex Horn formula is essentially convex. The rest of the section 
is devoted to proving the other direction — the final result can be found in Theorem |4.6[ 

Proposition 4.2. Any set S defined by a convex Horn formula ip over (M; *,-!-,<) is 
essentially convex. 

Proof. Our proof is by induction over the level of ip. Let m denote the number of free 
variables in ip. If the level of ip is 0, then S" = {x G M™" | ip{x)} is a closed convex set and, 
in particular, essentially convex. 

Assume that all relations defined by convex Horn formulas of level < i are essentially 
convex. Arbitrarily choose a convex Horn formula tp = pi 7^ OV---Vpfc 7^ OV0 with 
level i. Define S = {x € M"" | ij{x)}, ^ = {x G M"' | pi{x) = ■■■ = pk{x) = 0}, and 
[/ = {x S I (p{x)}. Since ip is level-i convex Horn, we know that for every semi- algebraic 
convex set C such that C C y, the set C DU can be defined by a convex Horn formula of 
level smaller than i and dim(C DU) < dim(5). 

Suppose for contradiction that there are a,b G S and an infinite set / of points on the 
line segment L between a and b that is not contained in S. In particular, / C 1/ since 



S = —iV U U. By Lemma 2.7, all points on the line through a and b are in V. However, a 



and b are in S and therefore in U so L n f/ is not essentially convex. We now note that L is 
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a semi- algebraic convex set that is a subset ofVsoLDU can be defined by a convex Horn 
formula of level smaller than i. Consequently, LOU is essentially convex by the inductive 
assumption which leads to a contradiction. 

Finally, suppose that is a finite conjunction of convex Horn formulas of level at most 
i. Since the intersection of finitely many essentially convex relations is essentially convex, 
we are done also in this case. D 

Next, we need some preparations for the proof of the converse implication (Theo- 
rem |4.6|) : we show that semi-algebraic relations can be defined by a special type of for- 



(Lemma 4.5) 



mula (Lemma 4.4), and that the closure S of an essentially convex relation S is convex 



Definition 4.3. Let S" be a semi-algebraic relation. We say that a first-order formula (j) is 

a standard defi of S if 

• 4'{xi, . . . , Xfc) defines C M*' over (M; *,+,—,<) with parameters from M; 

• (/> is in quantifier-free conjunctive normal form; 

• if we remove any literal from (p, then the resulting formula is not equivalent to (p; and 

• all literals are of the form t < or t ^ 0. 

Lemma 4.4. Every semi-algebraic relation S has a standard definition. If S is even semi- 
linear, then it has a standard definition cf) that does not involve the function symbol for 
multiplication and irrational parameters. 



Proof. By Theorem 2.8 , we know that S has a quantifier-free definition over (M; *,+,<) with 
parameters from M, and it is clear that such a definition can be rewritten in conjunctive 
normal form (j). Replace a clause a in (f) with a literal of the form a < b hy two clauses ai 
and Q2 obtained from a by replacing a < b hy a < b and by a 7^ 6, respectively. In the 
same way we can eliminate occurrences of a = 6 from (j) using < . Literals of the form a <b 
(a 7^ b) can then be replaced by a — 5 < (and a — b ^ 0, respectively). Finally, we remove 
literals from (f> as long as the resulting formula is equivalent to the original formula. 



If S is semi-linear, then by Theorem 2.10 we know that S has a quantifier- free definition 



over (M; +, — , <) with parameters from Q, and it is then clear that the formula constructed 
from (p as above will be a standard definition of S without the function symbol for multi- 
plication and irrational parameters. □ 

Lemma 4.5. The closure S of an essentially convex relation S is convex. 

Proof. Let a,b £ S. We will show that all points c on the line segment between a and 6 are 
in S. We have to show that for every e > there is a point c' in S such that the distance 
between c and c' is smaller than e. Since a G S and b G S, there are points a' £ S and 
b' £ S that are closer than e/2 to a and b, respectively. Let L be the line from a' to b' . It 
is clear that there are infinitely many points on L that are at distance less than e from c. 
Hence, since a' and b' are in S and S is essentially convex, there must be one such point in 
S, and we are done. □ 

Theorem 4.6. A semi- algebraic relation S C M" is essentially convex if and only if it has 
a convex Horn definition. Moreover, when S is even semi-linear then S has a semi-linear 
convex Horn definition. 



Proof. We have already seen in Proposition 4.2 that every relation defined by a convex Horn 



formula is essentially convex. We now show the more difficult implication of the statement. 
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Let (/) be a standard definition of S. The proof is by induction on the dimension d of S. For 
d = 0, the set S consists of at most one point, and the statement is trival. Otherwise, if 
15"! > 2, then by essential convexity, Corollary 3.2, and Lemma 2.6 we have that dim(S') > 1. 

For d > 0, we will construct two formulas 4'i,4>2 such that is equivalent to 0i A (/)2. 
Thereafter, we will show that (pi is equivalent to a conjunction of convex Horn formulas, 
and that 02 defines a closed convex relation (and consequently is convex Horn). Since finite 
conjunctions of convex Horn formulas are also convex Horn, (j) is then equivalent to a convex 
Horn formula. 

We begin by writing all clauses of as a — )• /3 where a is either 'true' or a conjunction 
of polynomial equalities and /3 is either 'false' or a disjunction of inequalities. This is always 
possible since a clause 

(pi < V • • • V pfe < V 51 / V • • • V / 0) 

is logically equivalent to 

(gi = A • • • A = 0) ^ (pi < V • • • V pfc < 0). 

Next, we rewrite all clauses a — )■ /3 where a is not equal to 'true', as (a — )• (/3 A 0)). Let 0i 
be the conjunction of all the implications of the type (a — )• {(3 A(f))) and (j)2 the conjunction 
of the remaining implications, i.e., those of the type (true — >• (3). The formula (pi A (j)2 is 
clearly equivalent to the formula (j). 

We begin by studying the formula (pi. Let a — t- (/? A 0) be a clause from (pi, let V be 
the variety defined by a, and let U be the set defined by /3 A 0. Observe that C/ C 5. We 
now show that the intersection of the set U with a semi- algebraic convex set C can be 
defined by a convex Horn formula. We make two claims about the set [/ n C: 

Claim 1. [/ n C is essentially convex. For arbitrary points a,b £ U DC, let Lab denote the 
line segment from a to b, and Xab = {x £ Lab \ x ^ U CiC}. Suppose for contradiction that 
there exist a,b £ U C such that Xab is an infinite set. Since a,b £ C and C is convex. 
Lab ^ C which implies that Xab = {x G Lab \ x ^ U}. Moreover, C C V so Xab ^ V- Now 
recall that a,b £ S since U CiC C. U S: thus, there are infinitely many points (those that 
are in Xab) between a,b £ S that are in V but not in U. This shows that no point in Xab 
satisfies a — )■ {(3 Ap), and Xab H 5 = 0. This fact contradicts the essential convexity of S. 

Claim 2. U Ci C has smaller dimension than S. Let T be the set S \ {U n C). It suffices 
to show that f7 n C is a subset of T \ T, because Lemma 2.6 asserts that dim(T' \ T) < 
dim(T) < dim(5). 

The set S must contain a point p that is not in V, because if 5" C y then we could 
replace the clause of (p that was re-written to a — )• (/? A (^) by /3 and obtain a formula that 
is equivalent to p; this contradicts the assumption that (p is a. standard definition of S. 

To show that {U DC) ^ T \ T, let x be an arbitrary point in U DC. Only finitely many 
points on the line segment between p and x can be from (U Ci C) C V, because otherwise 
Proposition |3.3| implies that V must contain the entire line between x and p, including p, 
a contradiction. Also the set S contains all but finitely many points on the line segment 
between p and x: this is by essential convexity of S, since x £ U Ci C S and p £ S. 
Hence, we can choose a sequence of points from T = S \ {U DC) on this line segment that 
approaches x, which shows that x £T. 
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Since U DC is semi-algebraic, essentially convex, and has smaller dimension than S, it 
follows by the inductive assumption that it can be defined by a convex Horn formula. Thus, 
01 is equivalent to a finite conjunction of convex Horn formulas. 

We claim that (j)2 defines a closed convex set D. This follows from Lemma 4.5, since D 
is in fact the closure of S. To see this, observe that D is clearly a closed set, D contains 
S, and hence S D = D. To prove that D <^ S, let y be from D \ S. Consider the 
clauses ai — )• A </)),..., a/ — )• {/3i A (p) of (pi, and let V^, for 1 < i < be the variety 
{x G M'^ I x satisfies Qj}. There must be a point q in S that is not contained in the set 



W = IJi<z otherwise. Proposition 3.3 implies that there exists an i < / such that S" C V^. 
In other words, all points in S satisfy a^. This is in contradiction to the assumption that cp is 
a standard definition of S, since in this case the formula (p is equivalent to the formula where 
the clause of (p that has been rewritten to — )• {f3iA(p) is replaced by Pi. Only finitely many 



points on the line segment L between q and y can be from W, because otherwise Lemma 2.7 



implies that W contain the entire line between y and q, including q, a contradiction. Hence, 

yes. 

Finally, consider the case that S is semi-linear. By Lemma 4.4 we can choose (p to be 
a standard definition which is semi-linear (and only uses parameters in Q). Then the proof 
above leads to a semi-linear convex Horn definition of 5. □ 



5. Applications 



5.1. Semi-linear constraint languages. We will now show that a finite semi-linear ex- 
pansion r of Fiin has a polynomial-time tractable constraint satisfaction problem if and only 
if all relations of T are essentially convex (unless P = NP). Recall that a relation is semi- 
linear if it has a first-order definition in (M;+,l,<). A quantifier- free first-order formula 
in CNF is called Horn-DLR [2l] (where 'DLR' stands for disjunctive linear relations) if its 
clauses are of the form 

Pi / V • • • V pfc / 

or of the form 

pi/OV---Vpfc/OVpo<0 
where Po,Pi, ■ ■ ■ ,Pk are linear terms with rational coefficients. A semi- linear relation is 
called Horn-DLR if it can be defined by a Horn-DLR formula. 



Theorem 5.1 (see (TO, 21 , 22]). Let T be a structure with domain M whose relations are 



Horn-DLR. Then CSP(r) is in P. 

In this section, we show the following. 

Theorem 5.2. Let T = {T\[^, Si, . . . ,Si) be a constraint language such that Si, . . . ,Si are 
semi-linear relations. Then, either each relation Si, . . . ,Si is Horn-DLR and CSP(r) is in 
P, or CSP(r) is NP-complete. 



In order to prove Theorem |5.2[ we need to characterize convex and essentially convex 
semi-linear relations. This is done in Lemma |5.3| and Theorem |5.4[ respectively. 

Let Pi, . . . , P„ be (possibly unbounded) polyhedra defined such that Pj = {x G M'^ | AiX < 
bi}. Bemporad et al. [3] define the envelope of Pi, . . . , Pn {env{Pi, . . . , P„)) as the polyhe- 
dron 
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{x G M'^ I A'^X < . . . , A'^X < b'J 

where A'^x < h\ is the subsystem of AiX < hi obtained by removing ah the inequahties not 
vaHd for the other polyhedrons Pi, ... , Pj-i, Pi+i, Pn- We note that if Pi, . . . , are defined 
with coefficients from Q, then env{Pi, . . . , Pn) can be described by rational coefficients, too. 
By combining Theorem 3 with Remark 1 in [3|, it follows that UiLi convex if and only 
if [Jti Pi = env{Pi,...,Pn). 



Lemma 5.3. A closed semi-linear relation S is convex if and only if it has a primitive 
positive definition in (M; +, 1, <). 

Proof. It is straightforward to verify that relations with a primitive positive definition in 
(M; +, 1, <) are convex; each relation defines a convex set and the intersection of convex 
sets is convex itself. 

For the converse, let (/> = V'l V • • • V ipm be a quantifier-free definition of S over the 
structure (M; +, — , <) with parameters from Q, written in disjunctive normal form. If there 
is a disjunct tpi that contains a literal p q, for linear terms p and q, then split the disjunct 
into two; one containing p — q < and one containing p — q > 0. By repeating this process, 
every literal p ^ q can be removed. Similarly, every literal p = q can be replaced by 
p — g<0 A q — p<^. Thus, we may assume that every literal appearing in the tpi is of the 
type p < or p < 0, for a linear term p. Let Di, . . . , D^i be the sets defined by ■01, • • • , V'm, 
respectively. 

Now recall that the topological closure operator preserves finite unions, i.e., Z)i U • • • U 
Djn = Z)i U • • • U Dm- Hence, 



S = Di[J ■ ■ - [J Dm<^DiU ■ ■ - [J Dm = Di\J ■ ■ - U Dm = S = S 

and 5 = Z)i U • • • U Dm- We now note that if P = {x G M'^ | Ax < 6, Cx < d} and P / 0, 
then P = {x G I Ax < b, Cx < d}, cf. Case (i) of Proposition 1.1 in 17 . Thus, each Z)j 
equals {x G M'^ | AiX < 6j} for some rational Ai,bi- Furthermore, S = IJI^i convex 
so = env{Di, . . . , Dm)- This implies that 5 = {x G M'^ | Cix < di, . . . , CmX < dm} 
for some rational matrices Ci, . . . , Cm and rational vectors di, . . . , dm- It is easy to see 
that each CjX < di is pp-definable in (M; +, 1, <) by the same technique as in the proof of 



Lemma 2.11, and this concludes the proof. D 



Theorem 5.4. A semi-linear relation S is essentially convex if and only if S is Horn-DLR. 

Proof. We first prove that every Horn-DLR relation S is essentially convex. Let i;^ be a 
Horn-DLR definition of S. Suppose for contradiction that there are a,b € S and an infinite 
set / of points on the line segment L between a and b is not contained in S. Since (j) has 
finitely many conjuncts, there is a conjunct ip in (f> that is false for an infinite subset /' of /. 
If ip is of the form pi 7^ V • • • V ^ 0, then all points in /' satisfy pi = ■ ■ ■ = pk = 0. By 



Lemma 2.7 the entire line L satisfies pi = ■ ■ ■ = pk = 0. This contradicts the assumption 



that a ^ L and b ^ L satisfy (j). If ip is of the form pi 7^ V • • • V 7^ V po ^ 0, then all 



points in /' satisfy pi = ■ ■ ■ = Pk = and pQ > 0. Again by Lemma 2.7 we find that both 
a and b must satisfy pi = ■ ■ ■ = Pk = 0. Since a, b also satisfy ip, we conclude that both 
points satisfy po < 0. But then also all points in L must satisfy po < 0, which contradicts 
the fact that the points in /' satisfy po > 0. 

The other direction of the statement can be derived from Theorem 14.61 as follows. Let 



S be an essentially convex semi-linear relation. By Theorem 4.6, S has a semi-linear convex 
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Horn definition We prove by induction on the level of ip that ip is Horn-DLR. If the level 
is 0, then S is closed and convex and the claim follows from Lemma 5.3 Now suppose that 
^p has level i > and is of the formpi / OV- • -Vpfc / OVV'', where pi = ■ ■ ■ = Pk = defines 
a set V, and tp' defines a set U such that for every semi-algebraic convex set C C y the set 
C nU has a convex Horn definition of level strictly smaller than i. Since ip is semi-linear, 
the terms Pi, ■ ■ ■ ,Pk are linear. Hence, V is convex, and by taking C := V in the statement 
above we see that V CiU has a convex Horn definition of level strictly smaller than i. By 
the inductive assumption, V CiU has a definition by a Horn-DLR formula (p with clauses 
01, ... , (pm- Then cp' = Ai<i<m(Pi / V • • • V pfc 7^ V is clearly Horn-DLR. We claim 
that (p' defines S. First suppose that a £ ^V. In this case, a clearly satisfies cp' and this is 
justified by the fact C S. Suppose instead that a £ V. Then a satisfies cp' if and only 
if it satisfies (p, and since (p defines V DU this is the case if and only if a satisfies ^p'. 

Finally, the statement holds if ip is the conjunction of finitely many convex Horn for- 
mulas (which are Horn-DLR by inductive assumption). O 

Proof of Theorem \5.S\ If all relations of F are Horn-DLR, then CSP(F) can be solved in 
polynomial time (Theorem 5.1). Otherwise, if there is a relation S from F that is not Horn- 
DLR, then Theorem 5.4 shows that S is not essentially convex, and NP-hardness of CSP(F) 



follows by Lemma 3.9 



So we only have to show that CSP(F) is in NP. Let $ be an arbitrary instance of 
CSP(F). By Theorem 2.10 every relation of F has a quantifier-free definition in conjunctive 
normal form over (M; +,—,<) with rational parameters. One can now non-deterministically 
guess one literal from each clause of in the defining formula for each constraint and verify 
- in polynomial-time by Theorem 5.1 - that all the selected literals are simultaneously 
satisfiable. □ 



5.2. Generalized linear programming. In this section, we study generalizations of the 
following problem. 

Linear Programming (LP) 

INPUT: A finite set of variables V, a vector c £ Q'^', a number M G Q, and a finite 
set of linear inequalities of the form aixi + • • • + OnXn < oq where xi, . . . , Xn G V and 
ao,...,an £ Q- All rationals are given by numerators and denominators represented in 
binary. 

QUESTION: Is there a vector x G M'^' that satisfies the inequalities and c^x > Ml 

We generalize LP as follows. Let F be a structure (Fun, . . . , Rm) such that . . . , Rrr, 
are semi-linear relations. 

Generalized Linear Programming for F (GLP(F)) 

INPUT: A finite set of variables V ^ a vector c G Q'^', a number M G Q, and a finite set $ 
of expressions of the form i?(xi, . . . , x^) where i? is a relation from F and xi, . . . , x^ G V . 
QUESTION: Is there a vector x G M'^' that satisfies the constraints and c^x > Ml 



This can indeed be viewed as a generalization of LP because of Proposition 2.12 the 
problem LP is polynomial-time equivalent to the problem GLP(Fiin). 
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Theorem 5.5. LetT = {M;Tii^, Ri, . . . , Ri) be a structure with semi-linear relations Ri, Ri. 
Then, either each Ri is Horn-DLR and GLP(r) is in P, or GLP(r) is NP-hard. 

Proof. If there is an Ri that is not Horn-DLR, then the relation Ri is not essentially convex 



by Theorem |5.4| and CSP((riin, is NP-hard by Theorem |5.2[ Clearly, GLP(r) is NP- 
hard, too. 

Assume instead that each Ri is Horn-DLR. We present an algorithm that actually solves 
a more general problem that includes GLP(r). Let $ be an arbitrary satisfiable Horn-DLR 
formulsj^ over variable vector x = (xi, . . . , Xn) and let c be a rational n- vector. 

We assume additionally that $ AZ? is satisfiable for every disequality literal (i.e., literal 
p{x) ^ a) D appearing in If $ A -D is not satisfiable, then every occurrence of -D in $ 
can be removed without changing the set defined by the formula. Furthermore, this check 
can be carried out in polynomial time by Theorem |5.1[ Hence, we may assume that $ 
has this additional property (which we will refer to as (*)) without loss of generality. Let 
<I> = A <I>" where consists of the clauses not containing any disequality literal p[x) ^ a, 
and consists of the remaining clauses. 

Given our algorithm returns one of the following three answers: 

• 'unbounded': for every i^T S Q, there exists a solution y such that c^y > K; 

• 'optimum: K^: there exists a K ^ Q and a solution y such that c^y = K, but there is no 
solution y' such that c^y' > K; 

• 'optimum is arbitrarily close to K^: there exists a K £ Q such that there is no solution 
y satisfying c^y > K, but for every e > there is a solution y' with c'^y' > K — e. 

We claim that the following algorithm solves the task described above in polynomial time. 

Step 1. Maximize c^x over (by using some polynomial-time algorithm for linear pro- 
gramming). Let K denote the optimum. If K = oo, then return 'unbounded' and stop. 

Step 2. Check whether <I> A c'^x = K is satisfiable. Note that c^x = K has a primi- 
tive positive definition in Pun, which furthermore can be computed in polynomial time by 



Lemma 2.11 Therefore this check can be reduced to deciding satisfiability of Horn-DLRs. 
If <I> A (rx = K \s satisfiable, then return 'optimum: K\ If this is not the case, then return 
'optimum is arbitrarily close to K\ 

We first show that the algorithm runs in polynomial time. Step 1 takes polynomial time 
since maximizing (Fx over is equivalent to solving a linear program with size polynomially 
bounded in the size of <I>. Finally, Step 2 takes polynomial time due to Lemma |2.11| and 
Theorem 15.11 

Next, we prove the correctness of the algorithm. Correctness is obvious if the algorithm 
answers 'optimum: in Step 2. For the remaining cases, we need to make a couple of 
observations. Define S = {x £ M" | x satisfies $} and S' = {x £ M" | x satisfies $'}. 
Let Di, . . . , Dm denote the disequality literals appearing in <I>. Let Hi be the set {x G 
M" I X does not satisfy Di}, 1 < i < m, and note that each Hi is a hyperplane. 

Observation 1. The formula = <I> A Di A ■ ■ ■ A Dm is satisfiable. 



■'Note that if we are given an instance of CSP((riin, . . . , i?;)), then it can be transformed into an 
equivalent Horn-DLR formula in polynomial time since there is only a finite number of relations in the given 
structure. Hence, there is no loss of generality in considering Horn-DLR formulas instead of CSP instances. 
Also note that the resulting formula is (up to a multiplicative constant depending on the structure) of the 
same size as the CSP instance. 
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Otherwise, S C Hi U • • - UHm- The set S is essentially convex and each is a variety, 
so there exists an 1 < i < m such that S Hi hy Proposition |3.3[ Consequently, ^ A Di is 
not satisfiable which contradicts the fact that ^ has property (*). 

Observation 2. For every e > there is a y G S" satisfying \c^w — c^y\ < e. 

Let d{-, •) denote the Euclidean distance in M", i.e., d{a, b) = \/Yl^=i{(^i ~ ^lY^ ^'^^ 1 1 " 1 1 
the corresponding norm, i.e., ||a|| = V a^a. 

Arbitrarily choose a point z that satisfies <I>^; this is always possible by Observation 1. 
Consider the line segment L between z and w. Note the following: if is a hyperplane in 
M", then either H intersects L in at most one point or L C Also note that w,z € S' 
and S' is convex so L C 5". Arbitrarily choose a clause C G and assume C = {pi{x) ^ 
V • • • V pfc(x) ^ y po{x) < 0). Assume that there exist two distinct points a,b £ L such 
that pi{a) = pi{b) = 0. If so, then every point c £ L satisfies pi{c) = 0. This is not possible 
since z £ L satisfies Di A • • • A Dm, and in particular pi{z) / 0. Hence, at most one point 
c £ L satisfies p{c) = 0, and c is the only point in L that potentially does not satisfy the 
clause C. This implies that only finitely many points in L do not satisfy ^" , and it follows 
that for every 5 > there is a point y £ S such that d{w, y) < S. 

We proceed by showing that if w,y £ M" and d{w,y) = d, then Ic^w — c^y\ < ||c|| • d. 
This follows from the Cauchy-Schwarz inequality (that is, |a^6| < ||a|| • ||5|| for vectors a,b 
in M"): 

\c^u! — c^y\ = \c^{w — y)\ < \\c\\ ■ \\w — y\\ = \\c\\ ■ d{w,y) = ||c|| • d. 

To find a vector y that satisfies Ic^w — c^y\ < e, we simply choose y £ S such that 
d{'w, y) < we know that such a y exists by the argument above. 

If the algorithm outputs 'unbounded' in Step 2, then arbitrarily choose a sufficiently 
large number k and note that there exists a vector w £ S' such that c^w > k. By Obser- 
vation 2, there exists a vector y £ S such that \c^w — c^y\ < 1. Hence, S has unbounded 
solutions, too. 

Assume finally that the algorithm answers 'optimum is arbitrarily close to in Step 3; 
Observation 2 immediately proves correctness in this case. □ 

6. Open Problems 

The most prominent open question is whether there are there are essentially convex relations 
S with a first-order definition in (M;*,+) such that CSP((riin, 5)) is NP-hard. Resolving 
this question is probably difficult, since the following closely related problem is of unknown 
computational complexity: 

Feasibility of Convex Polynomial Inequalities 

INPUT: A set of variables V, a set of polynomial inequalities each of which defining a con- 
vex set; the coefficients of the polynomials are rational numbers where the numerators and 
denominators are represented in binary. 

QUESTION: Is there a point in M'^I that satisfies all inequalities? 

One may note that the problems we have considered could be easier since they are 
defined over finite constraint languages. On the other hand, convexity is much more restric- 
tive than essential convexity; moreover, we are only given polynomial inequalities (there 



24 



M .BODIRSKY, P. JONSSON, AND T. V. OERTZEN 



are convex semi-algebraic relations that cannot be defined as the intersection of convex 
polynomial inequalities). Still, the computational complexity of the Feasibility Problem of 
Convex Polynomial Inequalities is open. 

Convex semi- algebraic relations are of particular interest in the quest for efficiently 
solvable semi-algebraic constraint languages because of a conjectured link to semidefinite 
programming. Every semidefinite representable set is convex and semi- algebraic. Recently, 
Helton, Vinnikov and Nie showed that the converse statement is true in surprisingly many 



cases and conjectured that it remains true in general 18 
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